An 0(log(n)) Fully Dynamic Algorithm for Maximum 

matching in a tree 

Manoj Gupta* Ankit Sharma^ 
Indian Institute of Technology Kanpur, India 

January 19, 2009 



Abstract 

In this paper, we have developed a fully-dynamic algorithm for maintaining cardinality of 
maximum-matching in a tree using the construction of top-trees. The time complexities are as 
CsJ follows: 
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1. Initialization Time: 0{n{log{n))) to build the Top-tree. 

2. Update Time: 0{log{n)) 

3. Query Time: 0(1) to query the cardinality of maximum-matching and 0{log{n)) to find 
if a particular edge is matched. 

1 Introduction 



Dynamic graph algorithms aim to maintain certain properties in a graph under insertion and/or 
^ deletion of edges or vertices from the graph. The motivation behind dynamic algorithms is to 
^ maintain the graph property without the need to compute from scratch after each update (insertion 
O or deletion). A dynamic algorithm is incremental and decremental if it can handle the cases of 
^ insertion and deletion respectively. A dynamic algorithm is fully-dynamic if it is both incremental 
• and decremental. Dynamic graph algorithms for maintaining connectivity and mimimum spanning 
O tree in a graph have been well studied. 

This paper presents a dynamic algorithm for maintaining maximum matching in a dynamic tree. 
Given a graph G(V, E), maximum matching gives the subset of E of maximum cardinality such that 
no two edges in the subset share a vertex. The best static algorithm for finding maximum matching 
^ in a general graph has 0(|-E'|y'|y|) time complexity [1]. The static algorithm for finding maximum- 
matching in a tree has 0{n) time complexity. The algorithm involves randomly choosing a leaf node 
and making the edge, incident on the leaf, matched. The matched edge is then deleted from the 
tree along with all the edges which are incident on the node to which the leaf node was attached. 
The process is recursively followed on the new tree till no nodes are left. The time complexity of 
the algorithm is 0{n), n being the number of nodes in the tree, since in each iteration atleast one 
node is removed and number of edges in a tree are n — 1. We omit the proof of correctness of this 
algorithm. 

Among dynamic algorithm for maximum- matching in a graph, the best time complexity is of a 
randomized algorithm given by Piotr Sankowski which has time complexity of 0(n^'^^^) [2]. In this 
paper, we give an O(logn) time algorithm to maintain maximum matching in a dynamic tree under 
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Figure 1: The figure on the left shows partition of the tree into three clusters P, Q and R, having 
boundary nodes (a, 6), {b,h) and {h,j) respectively. For instance, cluster Q is the subtree spanned 
between b and h. Clusters P and R are leaf-clusters. The figure on the right shows the equivalent 
top-tree representation of the tree partition wherein each cluster of the tree is denoted by an edge 
with end points of the edge being the boundary nodes of the corresponding cluster. For example, 
cluster Q is represented by an edge having end points b and h. 



insertion and deletion of edges, n being the number of nodes in the tree. We make use of top-tree, 
introduced in section 2, to represent the dynamic tree. The algorithm, the proof of correctness and 
time-complexity analysis are presented in subsequent sections. 



2 Top Tree 

Top-tree [-j] partitions a tree into clusters, a cluster being defined as a connected subtree of the given 
tree. In figure 1, P, Q and R are clusters of the tree. A top-tree has several levels and each level 
induces a partitioning of the tree into clusters. A higher level in a top-tree partitions the tree into 
smaller number of larger clusters compared to a lower level. To construct level I + 1 from /, the 
clusters at level / are combined according to prescribed rules to form the clusters at the next level in 
the top tree. At the lowest level, each edge of the tree is a cluster. At the highest level, the whole 
tree is seen as a single cluster. The number of levels in a top-tree is of 0{log{n)) where n denotes 
the number of nodes in the tree. The reason behind 0{log{n)) levels is that the cluster combination 
operations and the fact that we are dealing with a tree ensure that each successive level has atmost 
a constant ratio of number of clusters of the previous level. 

The cluster formation rules ensure that each cluster is connected to the rest of the tree at atmost 
two nodes. These two nodes are referred to as 'boundary nodes' of the given cluster. Those clusters 
which share only one node with the rest of the tree are called 'leaf-clusters'. Although a leaf cluster 
has only one node shared with the rest of the tree and therefore in principle should have only one 
boundary node, yet for the sake of uniformity across all clusters, we assign two nodes of a leaf-cluster 
as boundary nodes. One of the boundary nodes is the one through which the leaf-cluster is connected 
to the rest of the tree. The way the other boundary node is decided will be clear from discussion in 
the following paragraphs. At the lowest level of the top-tree, each edge is a cluster. The leaf edges of 
the tree are leaf-clusters. The boundary nodes for all clusters are the end-points of the edge forming 
the cluster. 

As mentioned earlier, clusters, at level /, are combined to form clusters at level /-|- 1 in a top-tree. 
Each cluster at level / -|- 1 is either a combination of two clusters of level / or is same as a cluster 
of level /. To form level / + 1 clusters, we combine clusters of level / in pairs to form level / -|- 1 
clusters till we cannot combine any more clusters. Each cluster of level I can participate in atmost 
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Figure 2: Example of Rake operation: Here cluster P is raked onto cluster Q to form cluster R. 
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Figure 3: Example of compress operation: Here clusters P and Q are compressed to form cluster R. 

one combine operation with another cluster of the same level to form a cluster at level / + 1. 

At any level, more than one cluster may share a common boundary node. In this case, we have an 
ordering of the clusters around the boundary node which is in counterclockwise orientation around 
the node. In the example below, clusters P, Q and R share the boundary node B; P is the successor 
of R and predecessor of Q. We call a cluster 'incident' on a node if the node is a boundary node of 
the cluster and the node is a boundary node of atleast one more cluster. 

A 



C D 

Two clusters can be combined only if they have a successor-predecessor relationship. Further, 
two clusters are combined either by a 'rake' operation or a 'compress' operation. 

1. Rake operation: One of the two clusters is necessarily a leaf cluster and the leaf cluster should 
be the predecessor of the other cluster. As shown in diagram, cluster P has boundary nodes A 
and B and cluster Q has boundary nodes B and C. Cluster P is a leaf cluster and is connected 
to the rest of the tree through B. Cluster Q may or may not be a leaf cluster and is connected 
to rest of the tree through B and C. Cluster P is 'raked onto' cluster Q to form cluster R 
with boundary nodes B and C. Cluster R contains all the edges and nodes which are part of 
clusters P and Q. 



2. Compress operation: Here the two clusters, sharing a common boundary node, should be the 
only clusters incident on the shared boundary node. As shown in diagram, cluster P has 
boundary nodes A and B and cluster Q has boundary nodes B and C. Further, the shared 
boundary node B has no cluster incident on it other than P and Q. Clusters P and Q are 
'compressed' to form cluster R with boundary nodes A and C. Cluster R contains all the edges 
and nodes which are part of clusters P and Q. 



Top-tree represents each cluster, at a given level, by an edge with end-points of the edge being 
the boundary nodes of the cluster. For the clusters which share a boundary node, the edges, corre- 
sponding to the clusters, likewise share the common boundary node. Hence, at each level the cluster 
partition is represented by a tree or a forest formed of the edges representing the clusters. The top 
tree for a tree is shown in figure 4. The tree is not shown explicitly in the figure and is same as 
the tree shown at level of the top-tree. Let {x,y)i denote the edge at level i. At level 0, we have 
the original tree. The edge (&, c)o is raked onto (a, c)o around c to give (a,c)i. The edge {c,d)o and 
{d,e)o is compressed at d to give (c, e)i. These are the only operations at this level. Hence, (a, c)i 
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Figure 4: This figure shows top-tree for the tree given at 1=0 level. In the upper half of the figure, 
the different levels of the top-tree are shown. The lower half shows the underlying partition of the 
tree into clusters at each level. The arrows shown in the construction of top-tree are labeled by r 
and c for rake and compress operation respectively. 

is the parent of (a, c)o and (&, c)o and (c, e)i is the parent of (c, (i)o and ((i, e)o. Cluster-combination 
operations are performed recursively to give successive levels and these levels form the top-tree for 
the given tree. 



2.1 Updation in a Top Tree 

Updation in a top-tree under insertion or deletion of edges can be performed in 0{log{n)) time. 
Addition or deletion of an edge makes some of the cluster-combination operations performed in the 
original top-tree invalid in the modified tree. Further, certain new combination operations might 
arise. We undo the invalid operations of the original top-tree and reconstruct the new top-tree 
level-by-level. 

We explain the updation operation in a top-tree with the help of an example. For a more detailed 
description and the proof for its time complexity, we refer the reader to the Renato F. Werneck's 
thesis[:>]. Let us take the tree constructed in the earlier section and look at what happens when an 
edge ((i, /) is added at the lowest level. The new top tree is given in Figure 5. Let us see how we 
can transform the original top tree above to top tree below. 

We have to add cluster (rf, /)o at the lowest level. This makes the compress operation of (c, d)^ 
and (d, e)o, made in the original top-tree, invalid as the common shared boundary node d now has 
and additional cluster (c?, /)o incident on it. As the original compress operation is invalid, hence at 
level 1, (c, e)i is deleted. Further, a new rake operation of (e,(i)o onto (rf, /)o is performed to give 
((i, /)i at level 1. Edge (c, d)i is also added at level 1 as (c, (i)o could not participate in any combine 
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Figure 5: This figure shows the construction of the new top-tree on addition of the edge {d, /). The 
arrows shown in the construction of top-tree are labeled by r and c for rake and compress operation 
respectively. 



operation at level 0. At level 2, edge (a,e)2 is deleted since its child (c, e)i has been deleted. Also, 
edges {a,d)2 and {d, f)2 arc added. Similarly, wc build rest of the top-tree. 

The construction of the top tree ensures that there are only a constant number of addition and 
deletion operations at each level. Hence, the time to update the top tree is O(logn). The intuitive 
reason behind the constant number of operations at each level is that each cluster is connected to the 
rest of the tree at atmost 2 points which means that any change to this cluster would affect only a 
constant number of clusters at that level namely the clusters which are the predecessor or successor 
of the cluster at its two boundary nodes. This completes our discussion on top-trees. 



3 Algorithm 
3.1 Observations 

We now present an 0{log{n)) update time fully dynamic algorithm for maintaing maximum-matching 
in a tree. We first present a few observations which aid in the development of the algorithm and 
then the algorithm itself. 

Observation 1: If we are able to maintain certain information with each cluster in a top-tree 
such that the information for a parent cluster can be derived from the information stored in its child 
clusters, then any change in information due to insertion or deletion of an edge at the bottom-most 
level of the top-tree can be propagated up the top-tree in as much time as the update operation takes 
for a top-tree. This observation motivates us to store some information with each cluster such that 
it follows the property mentioned above and that it helps in maintaining the maximum-matching. 

Before we present the information which we store with each cluster, let us introduce a few no- 
tations to aid us in this task. In a tree T, let Te denote the set of edges contained in T. We call 
a node 'matched' if any of the edges incident on it are matched. If none of the edges incident on 
a node is matched, we call the node 'unmatched'. If Q be a subtree of tree T and M, a particular 
matching of T, let Aq^m denote the status - matched or unmatched - of node A under matching M 
when restricted to the subtree Q i.e. we consider only those edges which are in Qe and are incident 
on A and in case, any of these edges under consideration are matched under M, we put Aq^m as 
matched and unmatched otherwise. 

In the diagram below, a particular matching M is shown for tree T wherein the dashed edges 
are matched and the solid edges are unmatched. Q is the subtree spanning between A and B. 
Aq,m is unmatched since the only edge under consideration A-ql is unmatched. Further, At,m 
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is matched since out of the two edges, A — tl and A — ql, under consideration, A — tl is matched. 
Again, Bq^m is matched since B — q3, the only edge under consideration, is matched and Bt,m is 
also matched since out of the two edges B — q3 and B — 12, under consideration, B — q3 is matched. 

Subtree Q 




If a node is matched, we call it black (b) and if it is unmatched, we call it white {w). In the 
previous example, Aq^m = ui, A^^m = b, Bq^m = b and B^^m = b. 

As mentioned earlier, each cluster in a top-tree is a sub-tree of the tree T such that it shares 
atmost two nodes with the rest of the tree, which we call the boundary nodes. Let us take a 
particular cluster, say Q, having boundary nodes A and B. Let us consider the set of matchings in 
Q, which make node A matched (6) and B unmatched (w) and consider the matching which has 
the maximum cardinality in this set. We denote the cardinality of this particular matching by 
Mq''"^ where bA and wb denote that A is matched (6) and B is unmatched (w). Mq^^'^ , Mq^^'^ 
and Mq*^^ are similarly defined. 

We now return to our orginal question of what information should be maintained with each cluster 
in a top-tree such that it helps in maintaining maximum-matching and follows the property that the 
information of the parent can be deduced from that of its children. With each cluster Q, we maintain 
the following information: Mq^"'^, M^^"'^, M^^^" and M^^f^'' . 

The reason behind this choice of information is presented below. Essentially, this information 
fulfills the two critera mentioned above. First, the information is sufficient to calculate maximum- 
matching as the cardinality of maximum-matching of Q, denoted by Mq, is the maximum number 
among Mq^"'^, Mq^'^'^ , Mq^^'^ and Mq'^'^ since these four cases exhaust the set of possibihties of 
matchings cases of A and B. 

Secondly, this information obeys the property that the information of the parent cluster can be 
calculated using the information stored in the child-clusters. To see how, we present the following 
observation. 

Observation 2: Let us consider two trees P and Q sharing a common node B as shown in figure. 
The internal structure of the subtrees is not shown in the figure. 



Subtree P Subtree Q 

A B C 

Let tree T denote the union of subtrees P and Q. Say we take a matching M of P and a 
matching N ol Q and produce a matching O of T as follows - under matching O, an edge e e Te 
follows matching imposed by M if e e and follows matching imposed by A^ if e e Q^. Clearly, we 
cannot combine any two arbitrary matchings M and A^, since then we can have two matched edges 
incident on node B, one in Q and the other in P. Hence, we can combine only those matchings of P 
and Q which lead to atmost one matched edge incident on B, in other words, atmost one of -Bp,M 
and Bq^n is black. 
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Let us suppose we have Mp^'"^, Mp'*"'^, Mp^''^ and Mp-^^^ for P and Mq^'"^, Mq^"'^, Mq^''^ 
and Mq^^^ . Can we generate M^^^^ from this available information? We can see that by imposing 
the constraint that we can combine only those matchings of P and Q which yield atmost one matched 
edge on i? in T, we get 

M^^*^ = max{M^^'''' + M^^''^, M"^^"^^ + M^^*^, M]?-^"'^ + M^^''^} (1) 

In the above equation, we can understand that LHS > RHS, as each possibility in RHS is a valid 
combination of matchings of P and Q inferred from the discussion above. The reason for LHS = 
RHS is that the options in RHS exhaust the set of possibilities. Similarly, we can obtain M^"^^^ , 
M^T^""^ and M^-^^^. A gain, Mx would be the maximum of the four numbers. 

Hence, we can see that if we know cardinality of certain constrained maximum-matchings for sub- 
trees P and Q, we can generate the cardinality of constrained maximum-matchings for T. This fact 
is used in the algorithm. With each cluster, we maintain the information of cardinality of the four 
constrained maximum-matchings. The information of a parent cluster can be calculated using the 
information of the child-clusters. To obtain the cardinality of maximum-matching for the whole tree, 
we find the maximum cardinality among the four constrained maximum-matchings for the cluster 
at the highest level of the top-tree. Further, whenever we add or delete an edge, it is an 0(1) time 
operation to update the information for each modified cluster in the top-tree as the information of 
the children clusters is enough to calculate information of the parent cluster. 



3.2 Algorithm 

We now describe the algorithm for maintaing maximum-matching in a tree using a top-tree. With 
each cluster P with boundary nodes, say, A and B we maintain the four values Mp*"'^, Mp^"'^, 
Mp^^"" and Mp^^^. 

For the base case, where each cluster, say P, consists of an edge {A,B), we have Mp"*"'-^ and 
are i. M^^""^ and M^-^""^ are invalid cases since the only edge {A, B) can either be matched 
or unmatched meaning either both A and B are either matched or unmatched. For the invalid case, 
we assign a special symbol null i.e. Mp^"'^ = null and Mq^^'^ = null. Any addition operation with 
a null value yields a null value and a maximum operation over a set consisting of null and non-null 
values should yield maximum value over non-null values if there are any and null otherwise. 

Let us now consider how we maintain the values Mp^^'^^u, v G {tf, h} when we have a rake or a 
compress operation. We have the following tables. The notation of clusters P, Q and R are same as 
shown in Figures 2 and 3. 

• Compress 



3. M^^"'^ 



ake 

2. M]^^''^ = maxiM^^""^ + M^^^^ , M^*"'^ + M]^^^^} 
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Figure 6: Top tree augmented with the information to maintain maximum-matching for the tree. 
Cardinahty of maximum-matching for this tree is 2 as this is the maximum-number among the four 
constrained matchings for the cluster at top-most level of the tree. 
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Figure 7: Updation of top-tree along with the augmented information when the edge {d, f) is added 
to the graph. The maximum-matching of the tree remains to be 2 as it is still the maximum number 
among the four constrained matching for the cluster at top-most level of the tree. 



3.3 Illustration 

Let us take an example to illustrate the algorithm (figure 6). We augment the top-tree of the earlier 
example with the information stored at each cluster. At each cluster Q, X and Y being the boundary 
nodes of Q, we store the tuple (M^^'"^, M^^''^, M^-^"'^, M^-^''^). For the purpose of illustration, we 
choose that boundary nodes as X such as X is situated vertically below Y in the figure. For example 
in (a, c)o, we choose X = a and Y = c as a occurs above c in the illustration. Further, we have 
represented null value by $. 

Now, on adding edge, {d,f), the top-tree is updated in the same manner (figure: 7) as it was 
updated earlier. In addition, we update the information stored at the clusters. 



3.4 Time Complexity and correctness 

The correctness of the algorithm can be inferred from two observations made in the section 3.1. The 
algorithm takes 0{log{n)) for updation since the top-tree updation takes 0(log(n)) time and the 
algorithm, in addition, does only a constant amount of work for each updated cluster. 

The query time to find the cardinality of maximum-matching is 0(1) since we simply need to 
find the maximum-number among the set of constrained matchings for the top-most cluster in the 
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3.5 Finding the matched edges 

We have so far presented an algorithm on how to maintain the cardinahty of maximum-matching. 
However, we have not shed hght on how to find whether a particular edge is matched or not. The 
answer to the question depends upon whether the tree in question has a unique maximum matching 
or not. For in case the tree in question has more than one maximum-mat chings then the question 
has either to be modified to asking whether a particular edge is matched in one of the several 
maximum-matchings or whether the edge is matched in the specified maximum-matching. 

We first present the algorithm for finding whether a particular edge is matched in atleast one 
among the several possible maximum-matchings. At each level of the top-tree, we have an ancestor 
of the edge in question. In this algorithm, we trace the path from the edge in question to the cluster 
at the top-most level in the top-tree via the ancestors of the edge. Each ancestor has four possible 
constrained matchings stored with it. Our algorithm chooses those constrained matchings of the 
ancestor which have the edge in question as matched. At the top most cluster of the top-tree, we 
check whether any of the chosen constrained matchings of the top-most cluster has cardinality equal 
to the maximum-matching. If yes, the edge is matched in atleast one of the possible maximum- 
matchings. If no, then the edge in question is not matched. 

We start from the edge in question and choose that constrained matching, among the four available 
constrained matchings at level 0, which makes the edge matched. We then go one level up and check 
which of the four constrained matchings stored at the parent cluster can be achieved by combining 
the choice of matching made at level with all possible valid matchings of the other child cluster of 
the parent. We include these achievable constrained maximum-matchings of the parent cluster in our 
choice set at level 1 since these constrained maximum-matchings make the edge in question matched. 
Iteratively, at level i we have a set of choices, which can be at maximum four in number as there 
are only four constrained matchings, and check which of these choices can lead to the constrained 
matchings stored at its parent at level i + 1 when combined through valid matchings of the other 
child cluster of the parent. Whichever of the constrained matchings of level i + 1 can be achieved are 
made as part of our choice at level i + At the cluster at the top-most level in the top tree, in case 
our choice set has non-empty intersection with the constrained matching of top-most cluster which 
lead to maximum-matching, then the edge in question is matched otherwise not. Further, in case at 
any level our choice set become empty, then the edge in question is not matched. 

This clearly takes 0{log{n)) time as there are 0{log{n)) levels in a top-tree and a constant amount 
of work is done at each level. Hence, to find whether an edge is matched among the potentially several 
maximum-matchings takes 0{log{n)) time. 

The above algorithm can be easily tweaked to find out whether a particular edge is matched or 
unmatched in the particular maximum matching where the matchings of say k edges are specified. 
Essentially, here for all the k edges we go up the tree and make choice sets, which confirm to the 
specified matchings of these edges, at ancestors of the k edges. Then we start from the edge in 
question and traverse up the top-tree and build the choice set where the edge in question is matched. 
If at any ancestor, the choice sets of the edge in question has zero intersection with the choice sets 
of the specified k edges, the edge is unmatched. Else, if at the top-most cluster, we have a non-zero 
intersection, the edge is matched. This again takes 0{k * log{n)) time. 

3.6 Tree with weighted edges 

The algorithm can be easily extended to tree with weighted edges by setting Mq*^^ = w[A,B) for 
all edge-clusters Q = {A, B) at level of the top-tree where w^A., B) specifies the weight of the edge 

(A 5). 
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4 Conclusion and Future Work 

This paper presented an 0{log{n)) update time fully-dynamic algorithm for maintaining maximum- 
matching in a tree. Future work on developing dynamic algorithms for maximum-matching for 
general graphs in particular bipartite graphs and planar graphs. 
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